DNN-Based Cepstral Excitation Manipulation for Speech Enhancement
نویسندگان
چکیده
منابع مشابه
Integration of DNN based speech enhancement and ASR
Speech enhancement employing Deep Neural Networks (DNNs) is gaining strength as a data-driven alternative to classical Minimum Mean Square Error (MMSE) enhancement approaches. In the past, Observation Uncertainty approaches to integrate MMSE speech enhancement with Automatic Speech Recognition (ASR) have yielded good results as a lightweight alternative for robust ASR. In this paper we thus exp...
متن کاملAn hmm-based cepstral-domain speech enhancement system
This paper describes a method of enhancing speech corrupted by additive uncorrelated noise. The approach adopted is to use cepstral-domain hidden Markov models to determine statistics of the clean speech and noise processes. A compensated model of speech corrupted by noise is generated using parallel model combination. MMSE and linear non-homogeneous estimators of the clean speech signal are de...
متن کاملImproved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis
We investigate an improved time-frequency trajectory excitation (ITFTE) vocoder for deep neural network (DNN)-based statistical parametric speech synthesis (SPSS) systems. The ITFTE is a linear predictive coding-based vocoder, where a pitch-dependent excitation signal is represented by a periodicity distribution in a time-frequency domain. The proposed method significantly improves the paramete...
متن کاملDNN-Based Feature Enhancement Using Joint Training Framework for Robust Multichannel Speech Recognition
Ever since the deep neural network (DNN) appeared in the speech signal processing society, the recognition performance of automatic speech recognition (ASR) has been greatly improved. Due to this achievement, the demands on various applications in distant-talking environment also have been increased. However, ASR performance in such environments is still far from that in close-talking environme...
متن کاملNormalized Features for Improving the Generalization of DNN Based Speech Enhancement
Enhancing noisy speech is an important task to restore its quality and to improve its intelligibility. In traditional non-machine-learning (ML) based approaches the parameters required for noise reduction are estimated blindly from the noisy observation while the actual filter functions are derived analytically based on statistical assumptions. Even though such approaches generalize well to man...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE/ACM Transactions on Audio, Speech, and Language Processing
سال: 2019
ISSN: 2329-9290,2329-9304
DOI: 10.1109/taslp.2019.2933698